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Abstract 

We take two new approaches to design efficient algorithms for transmitter optimization under rate 
constraints to guarantee the Quality of Service in general MIMO interference networks, named B-MAC 
Networks, which is a combination of multiple interfering broadcast channels (BC) and multiaccess 
channels (MAC). Two related optimization problems, maximizing the minimum of weighted rates under 
a sum-power constraint and minimizing the sum-power under rate constraints, are considered. The first 
approach takes advantage of existing efficient algorithms for SINR problems by building a bridge between 
rate and SINR through the design of optimal mappings between them so that the problems can be 
converted to SINR constraint problems. The approach can be applied to other optimization problems 
as well. The second approach employs polite water-filling, which is the optimal network version of 
water-filling that we recently found. It replaces almost all generic optimization algorithms currently 
used for networks and reduces the complexity while demonstrating superior performance even in non- 
convex cases. Both centralized and distributed algorithms are designed and the performance is analyzed 
in addition to numeric examples. 
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I. Introduction 

A. System Setup and Problem Statement 

We study the optimization under rate constraints for general multiple-input multiple-output (MIMO) 
interference networks, named MIMO B-MAC networks HI, where each transmitter may send data to 
multiple receivers and each receiver may collect data from multiple transmitters. Consequently, the 
network is a combination of multiple interfering broadcast channels (BC) and multiaccess channels 
(MAC). It includes BC, MAC, interference channels, X channels HI, IS, X networks H and most 
practical wireless networks, such as cellular networks, WiFi networks, DSL, as special cases. We assume 
Gaussian input and that each interference is either completely cancelled or treated as noise. A wide range 
of interference cancellation is allowed, from no cancellation to any cancellation specified by a valid binary 
coupling matrix of the data links. For example, simple linear receivers, dirty paper coding (DPC) Q at 
transmitters, and/or successive interference cancellation (SIC) at receivers may be employed. 

Two optimization problems are considered for guaranteeing the Quality of Service (QoS), where each 
data link has a target rate. The feasibility of the target rates can be solved by a feasibility optimization 
problem (FOP) which maximizes the minimum of scaled rates of all links, where the scale factors are 
the inverse of the target rates. All target rates can be achieved simultaneously if and only if the optimum 
of FOP is greater than or equal to one. FOP can be used in admission control. If the target rates are 
feasible, the system tries to operate at minimum total transmission power in order to prolong total battery 
life and to reduce the total interference to other networks by solving the sum power minimization problem 
(SPMP) under the rate constraints. 

We study both centralized and distributed optimizations. The centralized optimization with global 
channel state information (CSI) provides an upper bound of the performance and a stepping stone to the 
design of the distributed optimization algorithms. In some cases such as cooperative cellular networks, it 
is possible to obtain global CSI if the base stations are allowed to exchange CSI, making the centralized 
optimization relevant. In ad hoc or large networks, we have to design distributed optimization algorithms 
with local CSI. 

B. Related Works 

The SINR version of FOP and SPMP under SINR constraints have been well studied for various 
cases, e.g., |[6l- |[T0l using SINR duality ifTTI - lfTil . which means that if a set of SINRs is achievable in 
the forward links, then the same SINRs can be achieved in the reverse links when the set of transmit 
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and receive beamforming vectors are fixed. Tiius, optimizing the transmit vectors of the forward links is 
equivalent to the much simpler problem of optimizing the receive vectors in the reverse links. However, 
these algorithms lack the following. 1) They cannot be directly used to solve FOP and SPMP under 
rate constraints because the optimal number of beams for each link and the power/rate allocation over 
these beams are unknown; 2) Except for |9|, interference cancellation is not considered; 3) The optimal 
encoding and decoding order when interference cancellation is employed is not solved. 

Considering interference cancellation and encoding/decoding order, the FOP and SPMP for MIMO 
BC/MAC have been completely solved in ||T5| by converting them to convex weighted sum-rate max- 
imization problems for MAC. The complexity is very high because the steepest ascent algorithm for 
the weighted sum-rate maximization needs to be solved repeatedly for each weight vector searched by 
the ellipsoid algorithm. A high complexity algorithm that can find the optimal encoding/decoding order 
for MISO BC/SIMO MAC is proposed in |[T6l that needs several inner and outer iterations. A heuristic 
low-complexity algorithm in |[T6l finds the near-optimal encoding/decoding order for SPMP by observing 
that the optimal solution of SPMP must be the optimal solution of some weighted sum-rate maximization 
problem, in which the weight vector can be found and used to determine the decoding order. 

C. Contribution 

In summary, the FOP and SPMP for MIMO B-MAC networks have been open problems. The 
contribution of the paper is as follows. 

• Rate-SINR Conversion: One of the difficulties of solving the problems is the joint optimization of 
beamforming matrices of all links. One approach is to decompose a link to multiple single-input 
single-output (SISO) streams and optimize the beamforming vectors through SINR duality, if a bridge 
between rate and SINR can be built to determine the optimal number of streams and rate/power 
allocation among the streams. In Section IIV-AI we show that any Pareto rate point of an achievable 
rate region can be mapped to a Pareto SINR point of the achievable SINR region through two 
optimal and simple mappings that produce equal rate and equal power streams respectively. The 
significance of this result is that it offers a method to convert the rate problems to SINR problems. 

• SINR based Algorithms: Using the above result, we take advantage of existing algorithms for SINR 
problems to solve FOP and SPMP under rate constraints in Section IIV-BI and provide optimality 
analysis in Section IIV-CI 

• Polite Water-filling based Algorithms: Another approach is to directly solve for the beamforming 
matrices. For the convex problem of MIMO MAC, steepest ascent algorithm is used except for the 
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special case of sum-rate optimal points, where iterative water-filling can be employed |[T7l - |[T9l . The 
B-MAC network problems are non-convex in general and thus, better algorithms, like water-filling, 
than the steepest ascent algorithm is highly desirable. However, directly applying traditional water- 
filling is far from optimal |[20l - |[22l . In IH, we recently found the long sought optimal network 
version of water-filling, polite water-filling, which is the optimal input structure of any Pareto rate 
point, not only the sum-rate optimal point, of the achievable region of a MIMO B-MAC network. This 
network version of water-filling is polite because it optimally balances between reducing interference 
to others and maximizing a link's own rate. The superiority of the pohte water-filling is demonstrated 
for weighted sum-rate maximization in HI and the superiority is because it is hard not to obtain 
good results when the optimal input structure is imposed to the solution at each iteration. In Section 
IIV-DI using polite water-filling, we design an algorithm to monotonically improve the output of 
the SINR based algorithms for iTree networks defined later, if the output does not satisfy the KKT 
condition. Furthermore, in Section IIV-E[ purely polite water-filling based algorithms are designed 
that have faster convergence speed. 

• Distributed Algorithm: In a network, it is highly desirable to use distributed algorithms. The polite 
water-filling based algorithm is well suited for distributed implementation, which is shown in Section 
IIV-F[ where each node only needs to estimate/exchange the local CSI but the performance of each 
iteration is the same as that of the centralized algorithm. 

• Optimization of Encoding and Decoding Orders: Another difficulty is to find the optimal encod- 
ing/decoding order when interference cancellation techniques like DPC/SIC are employed. Again, 
polite water-filling proves useful in Section IIV-GI because the water-filling levels of the links can 
be used to identify the optimal encoding/decoding order for BC/MAC and pseudo-BC/MAC defined 
later. 

The rest of the paper is organized as follows. Section|n]defines the achievable rate region and formulates 
the problems. Section HIH summarizes the preliminaries on SINR duality and polite water-filling. Section 
HVl presents the efficient centralized and distributed algorithms. The performance of the algorithms is 
verified by simulation in Section |Vl The conclusion is given in Section |Vll 

II. System Model and Problem Formulation 

A. Definition of the Achievable Rate Region 

We consider a MIMO B-MAC interference network, consisting of multiple interfering BCs and MACs. 
There are L data links. Let Ti and Ri denote the virtual transmitter and receiver of link / equipped with 
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transmit antennas and L/^, receive antennas respectively. The received signal at Ri is 

L 

yi = ^H;,fcXfe+Wi, (1) 

k=l 

where G C^^^ ^ ^ is the transmit signal of link k and is assumed to be circularly symmetric complex 
Gaussian; H; ^ G C^"'^^^*; is the channel matrix between Tf^ and Rf, and G C^"'^^ is a circularly 
symmetric complex Gaussian noise vector with zero mean and identity covariance matrix. 

To handle a wide range of interference cancellation possibilities, we define a couphng matrix $ G M^^^ 
as a function of the interference cancellation scheme HI. It specifies whether interference is completely 
cancelled or treated as noise: if x^, after interference cancellation, still causes interference to x/, ^ = 1 
and otherwise, ^i^k = 0. For example, if the virtual transmitters (receivers) of several links are associated 
with the same physical transmitter (receiver), interference cancellation techniques such as dirty paper 
coding (successive decoding and cancellation) can be applied at this physical transmitter (receiver) to 
improve the performance. 

The coupling matrices valid for the results of this paper are those for which there exists a transmission 
and receiving scheme such that each signal is decoded and possibly cancelled by no more than one 
receiver. Possible extension to the Han-Kobayashi scheme, where a common message is decoded by 
more than one receiver, is discussed in IH. We give some examples of valid coupling matrices. For a BC 
(MAC) employing DPC (SIC) where the link is the l^^ one to be encoded (decoded), the coupling matrix 
is given by ^ = 0, VA; < Z and k = 1, VA; > I. In Fig. [TJ we give an example of a B-MAC network 
employing DPC and SIC. When no data is transmitted over link 4 and 5, the following 
are valid coupling matrices for link 1, 2, 3 under the corresponding encoding and decoding orders: a. xi 
is encoded after X2 and X2 is decoded after X3; b. X2 is encoded after xi and X2 is decoded after X3; c. 
xi is encoded after X2 and X3 is decoded after X2; d. There is no interference cancellation. 
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Note that when DPC and SIC are combined, an interference may not be fully cancelled under a specific 
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Figure L Example of a B-MAC network 



encoding and decoding order. Such case cannot be described by the coupling matrix of O's and I's defined 
above. But a valid coupling matrix can serve for an upper or lower bound. See more discussion in m. 

If not explicitly stated otherwise, achievable regions in this paper refer to the following. Note that 
^1,1 = by definition. The interference-plus-noise covariance matrix of the link is 

L 

ni = I + Y,^l,kill,k'Skillk, (2) 

k=l 

where is the covariance matrix of x^. We denote all the covariance matrices as 

Ei.L = (Si,5]2,...,5]i). (3) 
Then the achievable mutual information (rate) of link I is given by a function of 5]i:L and $ 

X;(5^1;L,*) = log|l + H;,zS;H[,^-l|. (4) 

Definition 1: The Achievable Rate Region with a fixed coupling matrix $ and sum power constraint 
Pt is defined as 

n^iPr) = U {rGR^: (5) 

Si:L:Ef=i Tr(S, )<Pt 

ri<Iii'Ei.,L,^),l<l<L}. 

A bigger achievable rate region can be defined by the convex closure of IJ*eH^* (^t)> where H is 
a set of valid coupling matrices. For example, if DPC and/or SIC are employed, H can be a set of valid 
coupling matrices corresponding to various valid encoding and/or decoding orders. 

The algorithms rely on the duahty between the forward and reverse links of a B-MAC network. The 
reverse links are obtained by reversing the transmission direction and replacing the channel matrices 
by their conjugate transposes. The coupling matrix for the reverse links is the transpose of that for the 
forward links. We use the notation "to denote the corresponding terms in the reverse links. For example. 
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in the reverse links of the B-MAC network in Fig. [TJ T1/T2 {R2/R3) becomes the receiver (transmitter), 
and X2 is decoded after xi and X3 is encoded after X2, if in the forward links, xi is encoded after X2 
and X2 is decoded after X3. The interference-plus-noise covariance matrix of reverse link I is 

L 

{li = I + ^^kM,i'^kilk,i, (6) 



fc=l 

T 



and the rate of reverse link I is given by X; ( $ I = log 



B. Problem Formulation 



This paper concerns the feasibility optimization problem (FOP) and the sum power minimization 
problem (SPMP) under Quality of Service (QoS) constraints in terms of target rates [X^*] ^ for a 
B-MAC network with a given valid coupling matrix 

FOP: max mm 7^ (7) 



L 

s.t. ^ 0,/ = l,...,Land ^Tr(Si) < Pt, 

1=1 

where Pt is the total power constraint; 

L 

SPMP: min ^Tx{T.i) (8) 

1=1 

s.t. ii{i:i.,L,^)>ii,'Siho,i = i,...,L. 

In FOP, if the optimum of the objective function is a and is greater than one, the target rates are feasible. 
The optimum input covariance matrices achieves a point on the boundary of the achievable region along 
the direction of vector [X^*] ^, i.e., the optimal rate vector satisfies [X^];^^ l = [X^''] ^. If the 
target rates is feasible for some power, SPMP finds the minimum total power needed. For the special case 
of DPC and SIC, the optimal coupling matrix or equivalently, the optimal encoding and/or decoding 
order of FOP and SPMP is partially solved in Section IIV-GI We first focus on centralized algorithms 
under total power constraints. Then we give a distributed implementation of the algorithm for SPMP 
under additional individual maximum power constraints. 

Although we focus on the sum power and white noise in this paper for simplicity, the results can be 
directly applied to a much larger class of problems with a single linear constraint ^S/ W/^ < 

Pt in FOP (or objective function X;f=i Tr (^Sl^W,^ in SPMP) and/or colored noise with covariance 
E wiwj = W;, which includes the weighted sum power minimization problem in QOl as a special 
case. Only variable changes = W^^Xl/W^^ and W^^H^ /W^ ^ are needed, where and 
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are positive definite for meaningful caseJl The single linear constraint appears in Lagrange functions 
for problems with multiple linear constraints |[23l . |[24l . and thus, the results in this paper serve as the 
basis to solve them ||25l . Special cases of multiple linear constraints include individual power constraints, 
per-antenna power constraints, interference constraints in cognitive radios, etc.. 

III. Preliminaries 

The algorithms are based on SINR duality, e.g., ifTOl . rate duality, and polite water-filling developed 
earlier [H. They are reviewed below. 

A. SINR Duality for MIMO B-MAC Networks 

The achievable rate region defined in Q can be achieved by a spatial multiplexing scheme as follows. 

Definition 2: The Decomposition of a MIMO Link into Multiple SISO Data Streams is defined as, for 
link / and Mi > Rank(5];), finding a precoding matrix = [^/pl^t/ 1, y/Pi^Mi^i,M,] satisfying 

Ml 

^l = T^l'rl=^Pl,mtl,mtl^, (9) 

m=l 

where t,^^ G C^^^xi is a transmit vector with \\ti^m\\ = 1; and p = ...,pi,A/i , ••■,Pl,m,,]'^ 
are the transmit powers. 

Note that the precoding matrix is not unique because = T^V with unitary V G £^m,xMi ^^^^ gives 
the same covariance matrix in Without loss of generality, we assume the intra-signal decoding order 
is that the stream is the to be decoded and cancelled. The receive vector for the m^^ stream of 
link I is obtained by the MMSE filtering as 

/Ml \-l 

ri,m = a«,m I X] ^l,lPl,i^lAl,i^l,l + 1 ^l,lhm, (10) 
\i=m+l / 

where ai^m is chosen such that ||r/ m|| = 1- This is referred to as MMSE-SIC receiver in this paper. 
For each stream, one can calculate its SINR. Let the collections of transmit and receive vectors be 

T = [k,m]m=l,...,Mi,l=l,...,L^ (11) 
R- = [^l,m\m=l,...,Mi,l=l,...,L- (1^) 



'For random channels, singular W; or W; will result in infinite power and rate with probability one. 
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The cross-talk matrix *(T,R) e ^^^i^'^^i^'^' between different streams |8| is a function of T,R, 

/ 1^ -^ \th / \ th 

and, assuming unit transmit power, the element of the ( X]i=i + ^ ) row and ( X]i=i Mi + nj 
column of is the interference power from the /c* Link's n"^ stream to the /"^ link's m"^ stream and is 
given by 



Then the SINR for the m*^ stream of Unk / is 



k,n 
l.m 







k = I and m > n, 
k = I, and m < n, 
otherwise. 



(13) 



(T,R,p) 



Pl,m rn^l,l^l,m 



L K'h 
k=ln=l 



(14) 



Such decomposition of data to streams with MMSE-SIC receiver is information lossless |26|, i.e., the 
sum-rate of all streams of link / is equal to the mutual information in 

In the reverse links, we can obtain SINRs using R as transmit vectors and T as receive vectors. The 
transmit powers are denoted as q = [qiA, ■■■,Qi,Mi, ■■■,Ql,i, ■■■,Ql,Ml]^- The intra-signal decoding order 
is the opposite to that of the forward link, i.e., the m* stream is the m"^ last to be decoded and cancelled. 
Then the SINR for the m^^ stream of reverse link / is 



(R,T,q) 



(15) 



k,n 



L Mk 
fc=ln=l 

For simplicity, we will use {T,R, p} ({R, T,q}) to denote the transmission and reception strategy 
described above in the forward (reverse) links. 

The achievable SINR regions of the forward and reverse links are the same. Define the achiev- 
able SINR regions 7^ {Pt) and T^t {Pt) as the set of all SINRs that can be achieved under the 
sum power constraint Pt in the forward and reverse links respectively. For a given set of SINR val- 



ues 7 







Yl\=i Mi + m\ diagonal element is 



m=l,...,Mi,l=l, 
th 



, define a diagonal matrix D(T,R,7°) G j^^^ a/, ^^^^^ 



We restate the SINR duality, e.g. [TOl, as follows. 



(16) 
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Lemma 1: If a set of SINRs 7" is achieved by the transmission and reception strategy {T, R, p} with 
IIpII^ = Pt in the forward Unks, then 7" is also achievable in the reverse links with {R, T,q}, where 
q satisfies = Pt and is given by 

q = (D-i(T,R,7°)-*^(T,R))-^l. 

And thus, one has 7i {Pt) = T^t {Pt)- 



(17) 



B. Rate Duality 

The rate duality of the forward and reverse links of the B-MAC networks is a simple consequence of the 
SINR duality 11]. The reverse link input covariance matrices are obtained by the following transformation. 

Definition 3: Let = J2m=iPi,rn^i,m^l ~ be a decomposition of Y^i-l- Compute the 

MMSE-SIC receive vectors R from (fTOb and the reverse transmit powers q from (ITtI) . The Covariance 
Transformation from 'Si-l to 'Si-l is 



M, 



^ qi,mri,mrl, 



1,...,L. 



(18) 



m=l 



We give the rate duality under a linear constraint X]i=i Tr ( W; j < Pt and/or colored noise with 



covariance E 



WlW 



W;. The covariance transformation for this case is also calculated from the 



MMSE receive beams and power allocation that makes SINRs of the forward and reverse links equal, as 
in ([TSl l. The only difference is that the identity noise covariance in Qi is replaced by and the all-one 



.W/t 



m=l,...,M,,l=l,...,L 



For convenience, let 



(19) 



vector 1 in (fTTl) is replaced by the vector j^tj^, 

denote a network where the channel matrices are [H; fc]; the input covariance matrices must satisfy the 
linear constraint X^^^ Tr ^XljW;^ < Pt', and the covariance matrix of the noise at the receiver of link 
/ is W;. Then the rate duality is restated in the theorem below. 
Theorem 1: The dual of the network ( fT9l ) is 



1=1 



(20) 



in the sense that 1) they have the same achievable rate region; 2) if 5]i;L achieves certain rates and 
satisfies the linear constraint in network ( fT9l ). its covariance transformation 5]i:L achieves better rates in 
network dlOll under the linear constraint X^Li Tr (S/ W/) = X^Li Tr (S/ W/) < Pt. 
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C. Polite Water-filling 

In m, we showed that the Pareto optimal input covariance matrices have a polite water-filling structure, 
which is defined below. It generalizes the well known optimal single user water-filling structure to 
networks. 

Definition 4: Given input covariance matrices obtain its covariance transformation Si;^ as in 

([TSl l. Let rij's and Cli's respectively be the corresponding interference -plus-noise covariance matrices. 
For each link /, pre- and post- whiten the channel H; ^ to produce an equivalent single user channel 
H/ = ^^"^Hi^iCl^ Define Q; = Cty^'Sitiy^ as the equivalent input covariance matrix of link 
/. The input covariance matrix Xl; is said to possess a polite water-filling structure if satisfies the 
structure of water-filUng over H/, i.e., 

Qi = Gi-DiG}, (21) 

where i^; > is called the polite water-filling level; the equivalent channel H^'s thin singular value 
decomposition (SVD) is = F;A;G| with € C^^'^^S G c^t^xA^,^ ^ R^'_^'^', and 
A*"; = Rank(H; ^). If all S;'s possess the polite water-filling structure, then Si;L is said to possess the 
polite water-filling structure. 

Theorem 2: The input covariance matrices I]i:L of a Pareto rate point of the achievable region and 
its covariance transformation "Ei-l possess the polite water-filling structure. 

The following theorem proved in [H states that XI; having the polite water-filling structure suffices 
for "Si to have the polite water-filling structure even at a non-Pareto rate point. 

Theorem 3: If one input covariance matrix XI; has the polite water-filling structure while other I]^, Xl^, 
k ^ I, are fixed, so does its covariance transformation X);, i.e., Q; = fi^^^^S^ri^^^ satisfies the structure 
of water-filling over the reverse equivalent channel Sj^ = Ct^ ^^^Hj^^fij Further more, S; can be 
expressed as 

±1 = VI (ai^ - (Ui^i^iHl + niY'^ , I = 1,...,L, (22) 
where ui is the polite water-filling level in (|2T]) . 

IV. Optimization Algorithms 

In this section, we present several related algorithms for the feasibility optimization problem (FOP) and 
the sum power minimization problem (SPMP) under rate constraints. SINR based and polite water-filling 
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Table I 
List of Algorithms 



Sec. 


Tab. 


Alg. 


Purpose 


IIV-BI 


El 


A 


SINR based, for EFOP 


llV-Bl 


m 


B 


SINR based, for ESPMP 


IIV-DI 




S 


Subroutine, for link i 


IIV-DI 


m 


I 


Improvement of A/B for iTree networks 


IIV-EI 


m 


w 


Subroutine, for water-filling level 


IIV-EI 


Nm 


PR 


Polite WF based, for iTree, SPMP 


IIV-EI 


Iviiil 


PRl 


Polite WF based, for B-MAC, SPMP 


IIV-FI 


m 


PRD 


Distributed version of PRl 


IIV-GI 


m 


O 


Enc./dec. order optimization, for FOP/SPMP 



based algorithms are designed. Algorithms for SINR version of FOP and SPMP have been designed 
in m, ifTOl . To take advantage of them, we show how to map a Pareto point of the achievable rate 
region to a Pareto point of the SINR region in Section IIV-AI and then use SINR based Algorithm A 
and B to solve FOP and SPMP respectively in Section IIV-BI The optimality of Algorithms A and B is 
studied in Section IIV-CI by examining the structure of the optimal solutions of FOP and SPMP. Then, 
for iTree networks defined later, Algorithm I is designed to improve the output of Algorithm A and B. 
The improvement and the optimal structure suggests that the rate constrained problems can be directly 
solved using Algorithm PR and PRl in Section ITV-El by polite water-filling, without resorting to the SINR 
based approach. In a network, it is desirable to have distributed algorithms, for which Algorithm PRD is 
designed in Section |TV-F[ Finally, we design Algorithm O to improve the encoding and decoding orders 
for all of the above algorithms when DPC and SIC are employed. For convenience, a list of algorithms 
in this paper is summarized in Table Jl 

A. Rate-SINR Conversion 

In order to find Pareto rate points of the achievable rate region by taking advantage of algorithms that 
finds Pareto points of the SINR region, one needs to find a mapping from a Pareto rate point to a Pareto 
SINR point. But multiple SINR points can correspond to the same rate and thus, multiple mappings exist. 
The following two theorems give an equal SINR mapping and an equal power mapping by choosing two 
decompositions of a MIMO link to multiple SISO data streams. Note that for the same total link rate. 
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different decompositions have different sets of SINRs of the streams and different number of streams. 
We show that equal SINR allocation or equal power allocation among the streams within a Mnk will not 
lose optimality. 

Theorem 4: For any input covariance matrices Si:L achieving a rate point [2^;];=^ i^, there exists 
a decomposition = = Y^m=iPi,rnU,mt\^^,l = l,...,L, with Mi > Rank(X;;), such that the 

corresponding transmission and MMSE-SIC reception strategy {T, R, p} achieves equal SINR for all 
streams of the same link, i.e., ji^m = e-^'/^^' — l,m = l,...,Mi,l = 1,...,L. Therefore uniform rate 
allocation over the streams of the same link will not lose optimality. 

The proof is given in appendix |A] and provides an algorithm to find the decomposition. An immediate 
consequence of Theorem |4] is a mapping of the Pareto boundary points of the achievable rate region to 
the SINR region. 

Corollary 1: Let Mi = Rank(Hi_i j^]. An SINR point [7;,^ = e^'/*^' - 1]^^^^ ^ is a Pareto 

boundary point in 7# {Pt), if and only if the rate point l ^ Pareto rate point in IZq, {Pt)- 

Therefore, the problem with rate constraints [X^*] ^ can be equivalently solved through the problem 
with SINR constraints [7;,^ = e^'/*^' - l] „=^...,m„z=i,...,l- 

The following theorem shows that uniform power allocation across the streams within a link will 
also not lose optimality, which is useful in designing algorithms for individual power constraints and/or 
distributed optimization |[27l . |[28l . 

Theorem 5: For any input covariance matrix XI, there exists a decomposition S = Ylm=i Vm^mXm 
such that the transmit power is uniformly allocated over the M streams, i.e., pm = Tr(5]) /M,\/m. 
Therefore uniform power allocation over the streams of the same link will not lose optimality. 

The proof is given in Appendix |B] 

B. SINR based Algorithms 

The results in Section ITV-AI serve as a bridge to solve the FOP or SPMP under rate constraints through 
the SINR optimization problems. First we show FOP is equivalent to the following SINR optimization 



^This will not lose optimality because by Theorem |2l the rank of the optimal input covariance matrix for link I is no more 
than the rank of H;,i. 
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problem in the sense of feasibility. 

EFOP : max min s.t. llpIL < Pr (23) 

{T,R,p} 7, 
^ ^ 1 < m < Ml " 

1<1<L 

where Mi = Rank (H; ;) is the number of streams of link I; = e^'^^^' — 1 is the target SINR for the 
streams of hnk 

Theorem 6: The optimum of FOP O is not less than 1 if and only if the optimum of EFOP ( [23] ) is 
not less than 1. 

Proof: If the optimum of EFOP is not less than 1, there exists a point [7;,™ > 7?]„_^ j^j ^ 
in 7* {Pt)- Then it follows from Corollary [T] that the rate point [X; = Af^log (1 + ^i^m) ^ -^l']/-! l 
lies in IZ^ {Pt), i-c, the optimum of FOP is not less than 1. The 'only if part can be proved similarly. 

■ 

Remark 1: If the target rates (Xj^, •••,X£) is not a Pareto point but feasible, the solution of EFOP may 
produce a Pareto rate point that is not a solution of the FOP. However, both will exceed the target rates. 
Similarly, SPMP is equivalent to the following SINR optimization problem 

^ l<m< Ml, 

ESPMP : min ||p||i , s.t. 7,,™ > 7?, (24) 
{T,R,p} 1 < / < L. 

Theorem 7: If |t, R, p| is an optimum of ESPMP (l24l ). the input covariance matrices 'Si-l produced 
by T and p must be an optimum of SPMP ([H). On the other hand, if Si;^ is an optimum of SPMP, 
there exists a decomposition leading to |t,R, p|, which is an optimum of ESPMP. 

Proof: If Si:/, produced by T and p is not an optimum of SPMP, there exists a solution S'^^.j^ such 
that [X|^]^_^ ^ can be achieved by a smaller total transmit power. Then it follows from Theorem |4] that 
there exists a decomposition of S^^.^ such that the target SINRs jf^ = e-^"/*^' — 1 are 

L ' J m=l,...,Mi,l=l,...,L 

achieved by the corresponding {T',R',p'} with ||p'||^ < ||p||i, which contradicts with the optimality 
of {t,R,p}. The reverse part can be proved similarly. ■ 

Remark 2: It is possible that neither SPMP nor ESPMP is feasible, i.e., the target rates or SINRs can 
not be achieved even with infinite power. In this paper, we only consider feasible problems. In practice, 
we can avoid solving the infeasible SPMP and ESPMP by solving the FOP with the maximum allowable 
sum power to test the feasibility first. 

By converting the FOP and SPMP to the simpler EFOP and ESPMP, these problems can be efficiently 
solved by the SINR duality based algorithms for MIMO beamforming networks lH, |[TOl . which is 
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summarized below. For fixed T, p, the optimal receive vector for each stream is decoupled and is given 
by the MMSE-SIC receiver in (fTOl) . The SINR duality in Lemma [T] implies that the coupled problem of 
optimizing the transmit vectors can be found by optimizing the receive vectors in the reverse links, i.e., 
for fixed R, q, the optimal receive vector for each stream in the reverse links is given by the MMSE-SIC 
receiver 



/m—l 



A, 



(25) 



. i=l 



where ili is obtained from Xl^ = Ylii=i^k.i'^k,i^\ ii ^ 7^ ^ using Q, and /3; „i is chosen such that 

II^Z,m,|| ~ 1- 

The optimization for p is different for the two problems. The algorithm designed in [81 for EFOP is 
described below. For fixed T, R, define the following extended coupling matrices lU 

D* Dl 



Dl 



■.T 



(26) 



(27) 



P'j' P'j' 

where and D are defined in (fT3l) and ([T6l ) respectively; and the SINR values in D are fixed as 
= 70,771 = l,...,Mi,l = where 7f's are the target SINRs in EFOP. With the optimal 

power p, all the scaled SINRs in (l23l) should be equal to the same value denoted as Cmax HI- Therefore 
p satisfies the equations 7^,^ = Cmaxli, fn = 1, Mi,l = 1, L and ||p|| = Pt, which together form 
the following eigensystem ||8l 

Tf'Pext = AmaxPext, (28) 

where Pext = [p"^, l]^ is the dominant eigenvector of T with its last component scaled to one; Amax = 
1 / C'max is the corresponding maximum eigenvalue. It was proved in |29l that for a non-negative matrix 
Y with the special structure (l26l ). the maximal eigenvalue and its associated eigenvector are strictly 
positive, and no other eigenvalue fulfills the positivity requirement. Therefore, the last component of Pext 
can always be scaled to one and the resulting p is a valid power vector. Similarly, in the reverse links, 
the optimal power q is obtained by solving the eigensystem below |[8l 

-'Vqext = Amaxqext; (29) 

where qext = [q^,l]"^ is the dominant eigenvector of A; and Amax is the corresponding maximum 
eigenvalue. 
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Table II 



Algorithm A (Solving EFOP) 



Choose p^°) > and T^^^ such that all streams have positive SINR. 
Set 71 ^ 0. 

While not converge do 

1. Update in the forward links 

a. Compute R("+i) from p(") and T^") using ([TOll; 

b. Compute A from T*") and R("+i); 

c. Solve eigensystem ( |29] l for q("+i); 

2. Update in the reverse links 

a. Compute t("+i) from q("+i) and R("+i) using dB); 

b. Compute Y from t("+i) and R("+i); 

c. Solve eigensystem (l28T l for p("+i); 
n n + 1; 

End 



For ESPMP, we use the algorithm in lITOl . Let and 7}^ respectively be the SINR for the 
stream of the forward and reverse link / after the update. To satisfy the SINR constraints, the standard 



power control is used to update p and q iteratively, where in each iteration, the power of the stream with 



over-satisfied (unsatisfied) SINR is reduced (increased): 




(30) 



,(n+l) ^ 7; (n) 



(31) 





The algorithms to solve EFOP and ESPMP are summarized in table inland table |lll] respectively. After 



obtaining T, R and p, the corresponding input covariance matrices for FOP and SPMP can be easily 



obtained. 



The convergence of these algorithms are proved in IH, ifTOl . It can be verified that the objective 
function in EFOP (|23]| is monotonically increased by Algorithm A |i8J. In Algorithm B, once the solution 
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Table III 

Algorithm B (Solving ESPMP) 

Choose p^°) > and T^^^ such that all streams have positive SINR. 
Set 71 ^ 0. 

While not converge do 

1. Update in the forward links 

a. Compute R("+i) from p(") and T^") using OOll; 

b. Update power in the forward links p("+i) = Ip (p'"-*); 

c. Let |7;",„ = ji.rn (T("),R("+i),p("+i)) ,for allrn,?}; 

d. Compute D and * from T("), R("+i) and |7,°„,|; 

e. Compute q^") = {T>-^ - *^)"^ 1; 

2. Update in the reverse links 

a. Compute t("+i) from q(") and R("+i' using (|25]i; 

b. Update power in the forward links q("+i) = I^, (^q(")); 

c. Let |7;"„^ ^ 7i^,„ (T("+i',R("+i),q("+i)) ,for allm,/|; 

d. Compute D and * from t("+i), R("+i) and |7,°,„|; 

e. Compute p("+i) = (D^i - *)"^ 1; 
n 71 + 1; 

End 



becomes feasible, i.e., all SINR values meet or exceed the minimum requirements, it generates a sequence 
of feasible solutions with monotonically decreasing sum power ifTOl . The optimality of these algorithms 
will be discussed in the next section. 

C. Optimality Analysis for SINR based Algorithms 

Algorithm A or B can find good solutions but may not find the optimum for general B-MAC networks, 
according to the numeric examples in Section |Vl But we can still obtain insight of the problem and derive 
improved algorithms by finding the necessary conditions satisfied by the optimum. 

To avoid deriving the necessary conditions with the non-differentiable objective function of FOP Q, 
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we rewrite it into the following equivalent problem 




(32) 



s.t. 



If - If 



yi / 1 



L 



^ 0, / = 1, • • • , -^^ and ^ Tr (S,) < Pt- 



1=1 



Then the following theorem holds. 

Theorem 8: Necessity: If Si-^ = ( Si, S^, ) is an optimum of FOPa (US) or SPMP dH), it must 



satisfy the optimality conditions below: 

1) It possesses the polite water-filling structure as in Definition |4l 

2) The achieved rates must satisfy Ii ('Si-l,^] = alf, I = 1, L, where for FOPa, a > is some 



On the other hand, if certain Si:^ satisfies the above optimality conditions for FOPa or SPMP, it must 
satisfy the Karush-Kuhn-Tucker (KKT) conditions of FOPa or SPMP, and thus achieves a stationary 
point. 

Sufficiency: If certain Si:^, satisfies the above optimality conditions for FOPa or SPMP and if the 



levels of in ^T^, then is the optimum of FOPa or SPMP. 

It can be proved by contradiction that the optimums of FOPa and SPMP are Pareto optimal. By 
Theorem |2j they possess the polite water-filling structure. The second optimality condition can be proved 
by a proof similar to that of Lemma 1 in ifTOl for ESPMP. The third optimality condition can also be 
proved by the contradiction that if the total transmit power is less that Pt, the extra power can be used to 
improve the rates of all links simultaneously. The connection between the necessary optimality conditions 
and the KKT conditions, and the sufficiency part are proved in appendix O 

We check whether the solutions of Algorithms A and B satisfy the optimality conditions. We use the 
notation "for the variables corresponding to the solution of Algorithm A or B. The following is obvious. 

Lemma 2: After the convergence of the Algorithm A or B, the following conditions are satisfied. 

1) In the forward (reverse) links, the MMSE-SIC receive vectors corresponding to T and p (R and 
q) are given by R (T). The set of SINRs achieved by {T,R, p} in the forward links equals to 
that achieved by {R, T, q} in the reverse links. 




weighted sum rate ^i^i (^i;Li ^) is a concave function of T^i-l, where P^'s are the polite water-filling 
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2) For Algorithm B, the achieved rates satisfy Xi (Si:L, = , I = 1, ■■■,L. 

3) For Algorithm A, satisfies Ya=i Tr (S/) = Pt- 

Note that the rates achieved by Algorithm A may not satisfy the condition Ti {Tii-l, ^) = aT^^l = 
1, L. In order to discuss the optimality, we modify the target rates in FOP/FOPa to = Xi (^v.l, 
Then, we can claim that the solution of Algorithm A also satisfies the second optimality condition. 

Furthermore, if one can prove that the solution possesses the polite water-filling structure, then 
satisfies all optimality conditions in Theorem [8] One might conjecture that the first condition on 
MMSE structure in Lemma |2] implies the polite water-filling structure. Unfortunately, this is not always 
true according to the following counter example. Consider a single user channel H with Rank (H) > 1 
and unequal non-zero singular values. If the transmit vectors are initialized as the non-zero right singular 
vectors of H, the algorithm will converge to a solution where the transmit and receive vectors respectively 
are the non-zero right and left singular vectors of H, and the transmit powers will make the SINRs of 
all streams the same. Then the solution does not satisfy the single-user water-filling structure. However, 
for a smaller class of channels, we have the following. 

Theorem 9: If Rank(H; ;) = 1,V/, the solution of Algorithm A (B) Si;^ satisfies all the optimality 
conditions in Theorem [H and thus achieves a stationary point. 

Proof: The polite water-filling structure oiYli-L can be proved by considering the transmission over 
the equivalent channel H/ = fi^ ^^^H; Decompose the forward and reverse equivalent input 
covariance matrices = and = fi^^^^Sjfil^^ to beams as = X]m=i ^',m^',m^Jm' 



where dirn = PL; 



2 



is the equivalent transmit power and u/ m = y/pi^m^i^m/ \J di^m 

~ jyr ~ ± ~ 1/2^ 

is the equivalent transmit vector; and Q; = Y,m=i di,m^i,m'^l ^, where di^^n = m,-. " ' 



and 



V; m = d]^"^ ^qi^m'^i^m/ \J ^i^m- The algorithm sets the number of data streams as Mi = Rank(H; 
1, V/, which does not lose optimality by Theorem |2] Since the interference-plus-noise is whitened in the 
equivalent channel H;, the MMSE receiver i reduces to the matched filter, i.e., v/ 1 = a; iH/u; i. 
Similarly, the MMSE receiver u; i in the reverse equivalent channel tlj is given by the matched filter: 
= A,iH|vi 1 = a/ 1/3; iHIh^u; 1, i.e., i is an eigenvector of Sj^H^. Since the initial point is 
chosen such that the SINRs of all streams are strictly positive, they must also be strictly positive after 
the convergence. Hence, u; i must be the eigenvector corresponding to the only non-zero eigenvalue 
6i^i of hJH;. Then = di^iui^iuj^^ = ui^i {vi - l/5i^i)+uj"p where Di = di^i + l/6i^i is the polite 
water-filling level. Therefore, the solution 'Tii-l satisfies the polite water-filling structure and all other 
optimality conditions by Lemma |2] ■ 
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Transmitters Receivers 




Encoding/Decoding Order A Encoding/Decoding Order B 

Figure 2. A network with topology loop and encoding/decoding order A is an iTree network, whose interference graph does 
not have any directional loop. With encoding/decoding order B, it is not an iTree network because the interference graph has 
directional loops. 



For general cases, Algorithms A and B may converge to a solution where some Sl^'s do not 

possess the polite water-filling structure. Then the rates of these links may be improved without hurting 
other links by enforcing the polite water-filling structure on these S^'s . In the next sub-section, we show 
how to do it by improving the algorithms for a sub-class of B-MAC networks named iTree Networks. 

D. Improved Algorithm for iTree Network?, 

iTree networks defined in HI appears to be a natural extension of MAC and BC. We review its definition 
below. 

Definition 5: A B-MAC network with a fixed coupling matrix is called an Interference Tree (iTree) 
Network if after interference cancellation, the links can be indexed such that any link is not interfered 
by the links with smaller indices. 

Definition 6: In an Interference Graph, each node represents a link. A directional edge from node i 
to node j means that link i causes interference to link j. 

Remark 3: The iTree network is related to but different from the network with tree topology, which 
implies iTree network only if the interference cancellation order is chosen properly. For example, a MAC 
which has tree topology is not an iTree network if the successive decoding is not employed at the receiver. 
On the other hand, even if there are loops in a network, it may be an iTree network if the interference 
cancellation order is right. We give such an example in Fig. |2] where there are four desired data links 
1, 2, 3, and 4, and dirty paper coding and successive decoding and cancellation are employed. With 
encoding/decoding order A, where the signal X2 is decoded after xi and the signal X3 is encoded after 
X2, each link I G {2, 3, 4} is not interfered by the first I — 1 links. Therefore, the network in Fig. |2]is an 
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iTree network even though it has a loop of nonzero channel gains. However, for encoding/decoding order 
B, SIC is not employed at R1/R2, and X2 is encoded after X3 at T2/T3. The network in Fig. |2]is no 
longer an iTree network because the interference graph has directional loops, making the iTree indexing 
impossible. 

Since the coupling matrix of the reverse links is the transpose of that of the forward links, the 
interference relation is reversed as stated in the following lemmas. Without loss of generality, we consider 
iTree networks where the l^^ link is not interfered by the first I — 1 links in this paper. 

Lemma 3: lU If in an iTree network, the l^^ link is not interfered by the links with lower indices, in 
the reverse links, the l^^ link is not interfered by the links with higher indices. 

We develop an algorithm to improve the performance of iTree networks. That is after the convergence 
of the Algorithm A or B, if any output of the algorithm Sj does not satisfy the polite water-filling 
structure, the objective (cost) in FOP (SPMP) can be strictly increased (decreased) by enforcing this 
structure at Unk i. 

We first define some notations and give a useful lemma. The output of the SINR based algorithm 
Tii-L achieves a rate point ^ with sum power Pt = (^O^ "^^^ algorithm also produces 

the corresponding covariance transformation ^i;^ computed from R and q achieving a set of rates 
X/ =Ti,l = 1, L. Fixing the input covariance matrices Sj, j = i + 1, L for the last L — i links, the 
first i links form a sub-network 

[[^iMk,i=i,...,i . E (S/) = B^, m]i=i,...)j , (33) 

where W/ = I + Xlj^i+i covariance matrix of the equivalent colored noise; 

= X]I=iTr (S/). By Theorem [U the dual sub-network is 

,^Tr(s,W,)=P^,[I,U_J. (34) 
1=1 J 

It is clear that after convergence, l^iii, = ^l^i, ll/,^ is the covariance transformation of T1\;L = 

(Si, Si). By Lemma 9 in [Jj, Si^j = ^Si, Sj^ is also the covariance transformation of fli-i = 

(Si,...,Sj), applied to the sub-network ( [331 ). 

The algorithm to improve the performance contains three steps. 

Step 1: Improve the rate of reverse link i by enforcing the polite water-filling structure on 51 j. By 
Lemma|3l the reverse link i causes no interference to the first i—1 reverse Unks. If we fix 5]^, / = 1, i— 1, 
the rate of reverse link i can be improved without hurting other reverse links in the sub network by solving 
the following single-user optimization problem: 
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max log 



I + Hl.S,H,,f2ri| 

i-l 

5.t. Tr (S,W,) < - J] Tr (±iWi) = Tr (^,W,) , 



(35) 



1=1 



where Cli = I + Zl=\ *fc,^H^ .EfeHfc,^ and = I + Zf=i+i **jHij^iH| j = fi^. By a simple 
extension of the solution with white noise and sum power constraint in |[30l to case of colored noise and 
linear constraint here, it can be proved that the optimal solution is uniquely given by the following polite 



water-filling procedure. Perform the thin SVD CI 



^1/2 



H, , n 



-1/2 



FiAjGj. Let Ni = Rank(Hi,i) and 



5ij be the j^^ diagonal element of Af. Obtain Dj as 



D, 



diag [dij, ...,di^Ni 
1 



(36) 



j = l,...,iVi, 



where f « is chosen such that Xlfci ^ij = Tr ( 'tlifli ) and can be obtained by conventional water-filling 



(37) 



algorithm. Then the optimal solution is given by 



nri/2F.D.Ftj^-i/2 



By Theorem [3l if Slj does not satisfy the polite water-filling, nor does 5],, which implies that Sj is not 
the optimal solution and "S- achieves a rate I- > %. 

Step 2: Improve the forward links by the covariance transformation from S'^.^ = ^Si, Si_i, 't][^ 
to "E^.- = {^i, for the sub-network. Due to the special interference structure of iTree networks, 
the calculation of the transmit powers of the covariance transformation can be simplified to be calculated 
one by one as follows. When calculating p/^m> the transmit powers Pk,m '■ = k = 

I + and pz_„, n = m + l,...,Mi have been calculated. Therefore, we can calculate Xl^ = 



m=l Pk,m^k,m^if 



fc,m' 



/ + and obtain the interference-plus-noise covariance matrix of the 



link / as rij = I + Ylk=i+i *«,fcHz,fcS;),H[^,. Then obtain pi^rn as 



Pl,r. 



/to' _i_ Y^A^i 



(38) 



Finally is given by 



m=l 



t 



(39) 
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Table IV 

Algorithm S (Improving the Rate of Link i for iTree Networks) 

1. Obtain tli-i from the solution of Algorithm A or B. 

2. Solve for the optimal in the optimization problem ([35]) 
by polite water-filling. 

3. Calculate the input covariance matrices S^, Z = 1, i for the first 
i links by the covariance transformation as in ( |39] l. 

4. Output the updated input covariance matrices 

By Theorem [T] the covariance transformation T,-^.- achieves a set of rates X- > X- > 2j and > 
i'l = < i in the sub-network under the sum power constraint J2]=i (^j) = X]i=i ^X1;W;^ + 
Tr ^S-Wi^ = P^. Noting that the first i links cause no interference to all other links in the original 
network, the input covariance matrices Xl^^.^ = (S'^, • • • , Sj+i, • • • , S^) must achieve a rate point 
Xj- > Xj and X^' > X;,V/ 7^ i in the original network with the same sum power Px- 
We refer to the above algorithm as Algorithm S and summarize it in table |lVl 

The performance can be strictly improved using the output of Algorithm S. For FOP, we first reduce 
the transmit power of link i until its rate is reduced to Xj. This may benefit other links as well because the 
interference to other links is also reduced. Then this extra power can be used to simultaneously increase 
all link's power by the same factor, and thus improve the rates of all links and the objective function of 
FOP. The cost function of SPMP can be strictly decreased by reducing the transmit power of link i such 
that the rate is reduced to X?. Note that the above operations can be automatically achieved by Algorithm 
A (B) using "^'i-i as the initial point. According to the above, we propose an improved algorithm for 
iTree networks using Algorithm S as a component. It is referred to as Algorithm I and summarized in 
table lYl The optimality of Algorithm I is stated in the theorem below. 

Theorem 10: For iTree networks, the solution of Algorithm I satisfies all the optimality conditions in 
Theorem [8j and thus achieves a stationary point. 

Proof: For iTree networks, the SPMP is always feasible. Step 2 of the Algorithm I generates a 
feasible solution Si;L. Then in step 3, both Algorithm S and Algorithm A (B) will monotonically 
increase (decrease) the objective (cost) in FOP (SPMP). Since the objective (cost) is upper bounded 
(lower bounded). Algorithm I must converge to a fixed point. If the optimality conditions in Theorem [8] 
is not satisfied. Algorithm I will strictly increase (decrease) the objective (cost) in FOP (SPMP), which 
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Table V 

Algorithm I (Improved Algorithm for iTree Networks) 



1 . Generate random initial point such that all streams have 
positive SINR. 

2. Run Algorithm A or B to obtain the solution 'Ei-.l- 

3. Repeat 

For i = 1 : L 

If Si does not satisfy the polite water-filling structure 
Obtain S^^.^ from using Algorithm S. 
Run Algorithm A or B with Sj^.^ as the initial point to 
obtain the solution 
End 
End 
Until converge 

contradicts with the assumption of fixed point. ■ 
Actually, in almost all simulations we conducted for general B-MAC networks, Algorithm A or B with 
initial point randomly generated from a continuous space is observed to converge to a stationary point. 
Therefore, Algorithm I seldomly runs Algorithm S. Therefore, Algorithm S is more of theoretic value 
for the convergence to a stationary point and serve as a basis for the algorithm in the next subsection. 

E. Polite Water-filling based Algorithms for SPMP 

1) Algorithm PR for iTree Networks: In stead of converting the rate constrained problem to the SINR 
constrained problem, we can modify Algorithm S to directly solve SPMP for iTree networks. In (l36l ). 
the polite water-filling level Ui is chosen such that the sum power is unchanged when switching to the 
forward links. Because this polite water-filling level will improve the rate of reverse link i, if the initial 
solution is feasible, i.e., the rate of reverse link i is no less than I^, we can reduce the polite water-filling 
level Vi to make the rate of reverse link i equal to Z^, and thus reduce the sum power when switching 
to the forward links. This results in an algorithm which monotonically decreases the sum power once 
the solution becomes feasible. A simple algorithm in Table |Vl] referred to as Algorithm W can be used 
to calculate the polite water-filling level Ui to satisfy the rate constraint if. 

This modification of Algorithm S is referred to as Algorithm PR and is summarized in table IVIII where 
P stands for Polite and R stands for Rate constraint. It can be shown that once Algorithm PR finds a 
feasible solution, it will monotonically decrease the sum power until it converges to a stationary point. 
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Table VI 

Algorithm W (SOLVING THE Polite Water-filling Level for the Rate Constraints) 

1. Initialize the set of indices of the streams of Hnk i as 
r = {!,..., iVj, where A^, = Rank(H,,0. 

2. Calculate i^i = I e = /IljgrOi.j ) , which is the solution of 

E,erlog(l + (i^.-l/4,)4,)=l0. 
Obtain dij = — l/Sij for j £ T. 

3. If > 0, yj e r, stop. Otherwise, for all j e T, if dij < 0, 
fix it as di,j — 0, delete j from F. Repeat step 2). 

Table VII 

Algorithm PR (Solving SPMP for iTree Networks) 



Initialize such that y 0,Vi. 
While not converge do 
For i = 1 : i 

1. Calculate Si ^ by the covariance transformation of Si:^ 
applied to the sub-network. 

2. Obtain by polite water-filling as in (l36T l and (|37] |. where 
the polite water-filling level I'i is calculated by Algorithm W. 

3. Calculate Sj^.j by the covariance transformation of 

Sj^.j = I^Si, applied to the sub-network. 

4. Update as Si:l = (s'l, S;, S^). 
End 

End 



2 ) Algorithm PRl for B-MAC Networks: To get rid of the covariance transformation which is more 
complex than the polite water-filling as will be discussed later, and to make the algorithm work for 
general B-MAC networks, we obtain an intuitive algorithm by imposing the polite water-filling structure 
iteratively. It is referred to as Algorithm PRl and is summarized in table IVIIII It turns out that the 
algorithm can also be derived from the Lagrange function and KKT conditions of the problem, where the 
Lagrange multipliers are exactly the water-filling levels of the links. Adjusting the Lagrange multipliers to 
satisfy the rate constraints is exactly what Algorithm W does. It is clear that if the algorithm converges, 
the solution of Algorithm PRl satisfies the optimality conditions in Theorem [8l and thus achieves a 
stationary point. 
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Table VIII 

Algorithm PRl (Solving SPMP for B-MAC Networks) 

Initialize and fJ^'s such that ^ 0,Vi and = I,Vz. 
While not converge do 

1. Update in the forward Unks 

a. For Vi, obtain tti from using 

Perform thin SVD rjri^^H,,;!!^^''^ = FiA.Gj. 

b. Obtain by the water-filHng in ( [36b , where 

the poUte water-filling level is calculated by Algorithm W. 

c. Update S^'s as 

= f2-i/'GiD,G|f27'/',Vi. 

2. Update in the reverse links 

a. For Vz, obtain fJj from using Q. 
Perform thin SVD rir^/^jj^ .$7-1/2 ^ y.A.gI 

b. Obtain D; by the water-filling in ( l36b , where 

the polite water-filling level i/; is calculated by Algorithm W. 

c. Update S/s as 

End 



It is difficult to prove the convergence of Algorithm PRl. But the intuition and all simulations we 
conducted strongly indicate fast convergence. 

Remark 4: Algorithm PRl can be used to solve the FOP by replacing constraints with and 
searching for a to satisfy the power constraint. 

Remark 5: An advantage of Algorithm PRl is that it can be easily implemented distributedly as will 
be shown in Section IIV-FI Another advantage is that it has linear complexity in each iteration, because 
the SVD for polite water-filling is performed over the matrices whose dimensions are not increased with 
the number of desired data links L. However, the complexity order of Algorithm B depends on L. In 
each iteration, the complexity of calculating all the MMSE-SIC receive vectors is still linear with respect 
to L. But to calculate the transmit powers p and q, we need to solve two J2i=i Afrdimensional linear 
equations, whose complexity depends on the density and structure of the cross-talk matrix ^' (T,R). In 
the worst case, the complexity order is O {L^)- In other cases such as with triangular or sparse ^' (T, R), 
the complexity is much lower. Fortunately, in practice, ^' (T, R) is usually sparse for a large wireless 
network because of path loss. 
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F. Distributed Implementation of Algorithm PRl 

In a network, it is desirable to use distributed optimization. The above centralized algorithms serve 
as the basis for distributed design. Here, we design a distributed algorithm based on Algorithm PRl for 
time division duplex (TDD) networks.. To perform the polite water-filling in Algorithm PRl, Ti (Ri) only 

— 1/2 ^ —1/2 ^ I —1/2 ^ 1/2 

needs to know the equivalent channel ft^ Hi iftj^ , where both fli (fli) and H| (H/ /fi^ ) 

can be obtained by pilot-aided estimation in the reverse (forward) Links in time division duplex (TDD) 
networks. In frequency division duplex (FDD) networks, the equivalent channel needs to be calculated 
from feedback. Thus, TDD system has an advantage. 

We assume block fading channel, where each block consists of a training stage followed by a transmis- 
sion stage. The training stage is further divided into several rounds, where one round consists of a half 

~ —1/2 

round of pilot aided estimation of and fli in the forward link and a half round of pilot aided 

t —1/2 ^ 

estimation of H| ^fi^ and fi; in the reverse link. Since Algorithm PRl achieves most of the benefit in 
very few iterations, the required number of training rounds is small and can be as less as 2.5 rounds. In 
the training stage, the Tfs and iij's run a distributed version of Algorithm PRl to solve SPMP. At the 
end of the training stage, T;'s use the latest "Ei's as the input covariance matrices for the transmission 
stage. 

First, we describe the operation at each node. 
Operation at Tf. 

• In the {i — 1)* reverse training round, T/ estimates the interference-plus-noise covariance matrix 
r2[*~^^ and the effective channel ^ri[* 

• At the beginning of the forward training round, T/ calculates the input covariance matrix 

by polite water-filling over the equivalent channel ^ri[* ^■'^ ^ (^Clf" as in step 1 of 

Algorithm PRl. However, in practice, the transmit power of T; can not exceed certain maximum 
value denoted by Pl?^". If Tr ( > P^'^", decrease the polite water-filling level vi obtained by 



Algorithm W until Tr y^fj = P™^^. Then Ti transmits pilot signals in the forward training 
round. 
Operation at Ri: 

• In the forward training round, Ri estimates fl^*^ and H^^^ ^fij* 

• At the beginning of the i^^ reverse training round, Ri calculates by polite water-filling over 
the equivalent channel (^J^f ^^''^ (^fif^^^^ as in step 2 of Algorithm PRl. Denote P^^'' as 
the maximum transmit power of Ri. If Tr ( > P^^'^, decrease the polite water-filling level ui 
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Table IX 

Algorithm PRD (Distributed Version of Algorithm PRl) 



Initialize i = 1 and n|°' = I, f2|°' = I, V/. 

1. In the i* forward training round, T; calculates s|*^ by polite 

water-filling over ( fi; 1 H/ ; ( $7; 1 and transmits 
pilot signals. (In the round, since the equivalent channel is 
unknown, can be chosen randomly.) 



Ri estimates and H,,, {Ctf 



-1/2 



2. In the reverse training round, Ri calculates by polite 
water-fiUing over I (j^l*'') and transmits 



pilot signals. T/ estimates and h|j 



-1/2 



3. Let i — i + I and enter the next round. Keep updating s|'' and 
until the end of the training stage. 



obtained by Algorithm W until Tr j = P^^'^. Then Ri transmits pilot signals in the i reverse 
training round. 

Then, we summarize this distributed algorithm for calculating the input covariance matrices in Table 
HXl and refer to it as Algorithm PRD. 

Note that one training round is almost the same as one iteration in Algorithm PRl except that the 
transmit power of each node is constrained not to exceed a maximum value. Therefore, after convergence, 
if the transmit power of each node does not exceed its maximum transmit power. Algorithm PRD achieves 
nearly the same performance as Algorithm PRl. However in practice, it is desirable to allocate as less 
number of training rounds as possible. Then, Algorithm PRD may not be able to fully converge and thus 
the target rates may not be satisfied. Fortunately, it is observed in the simulations that Algorithm PRD 
converges very fast at the first few rounds. 

G. Optimization of the Encoding and Decoding Order 

For the special case of using DPC and SIC to cancel interference, the coupling matrix $ (vr) is a function 
of the encoding and decoding order vr. In this section, we partially characterize the optimal encoding and 
decoding order vr and design an algorithm to find a good vr. It is in general a difficult problem because 
the encoding and decoding orders at the BC transmitters and the MAC receivers need to be solved jointly. 
However, for each Pseudo BCIPseudo MAC defined below, the optimal vr is characterized in Theorem 
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Definition 7: In a B-MAC network, a sub-network with a single physical transmitter and a set of 
associated links whose indices forms a set £b is said to be a Pseudo BC if either all links in 
completely interfere with a link k or all links in Cb do not interfere with a link k, \/k G £g , i.e., the 
columns with indices in Cb of the coupling matrix excluding rows in Cb, are the same. A sub-network 
with a single physical receiver and a set of associated links, whose indices forms a set Cm, is said to be 
a Pseudo MAC if either all links in Cm are completely interfered by a link k or all links in Cm are not 
interfered by a link k, \/k G C^, i.e., the rows with indices in Cm of the coupling matrix excluding 
columns in Cm, are the same. 

Example 1: In Fig. [T] suppose xi is encoded after X2 and X4 is the last one to be decoded at the 
second physical receiver. Then link 2 and link 3 forms a pseudo MAC because they belong to the same 
physical receiver and suffer the same interference from xi, X4 and X5. Similarly, link 4 and link 5 forms 
a pseudo BC. 

Remark 6: The pseudo BC and pseudo MAC were first introduced in fT\ where the optimal encod- 
ing/decoding order of each pseudo BC/pseudo MAC for the weighted sum-rate maximization problem 
(WSRMP) is shown to be consistent with the optimal order of an individual BC or MAC. This is because 
a Pseudo BC or a Pseudo MAC can be isolated from the B-MAC network to form an individual BC or 
MAC. Similar results can also be obtained for FOP and SPMP as shown below. 

First, we need to modify the FOP and SPMP to include encoding and decoding order optimization 
and time sharing. Let H be a set of valid coupling matrices produced by proper encoding and decoding 
orders of a B-MAC network. Define a larger achievable region 



7^(PT) = Convex Closure |J 7^# (Pt) ■ 



The modified optimization problems are 



OFOP: 



max 

rG7^(P^) 




(40) 



and 



OSPMP: min Pt 



(41) 



s.t. m=i,...,L e nPr). 
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The following lemma is a consequence of that all outer boundary points of TZ{Pt) are Pareto optimal 
and can be proved by contradiction. 

Lemma 4: The optimal solution of OFOP or OSPMP is the intersection of the ray a [Xj', • • • , a > 
0, and the boundary of 1Z{Pt), where for OSPMP, the sum power Pt is chosen such that the intersection 
is at [jO,-- - ,X0]. 

The following theorem characterizes the optimal encoding and decoding order for those boundary 
points of TZ{Pt) that can be achieved without time sharing and by DPC and SIC. The sufficiency part 
only holds for MAC and BC. 

Theorem 11: Necessity: If the input covariance matrices f^i.L and the encoding and decoding order 
TT for a valid coupling matrix achieves the optimum of OFOP and OSPMP without time sharing, they 
must satisfy the following necessary conditions: 

1) satisfies the optimality conditions in Theorem [H 

2) If there exists a pseudo BC (pseudo MAC) in the B-MAC network, its optimal encoding (decoding) 
order fr satisfies that, the signal of the link with the largest (smallest) polite water-filling level 
is the 77,* one to be encoded (decoded). 

Sufficiency: In MAC or BC, if certain Tiix and tt satisfy the above conditions, then f^i.L and tt achieves 
the global optimum of OFOP or OSPMP. 

Proof: The first necessary condition follows from Theorem [H Then the second necessary condition 
follows from the following two facts and Lemma IH 1) Any outer boundary point of 1Z{Pt) must be the 
solution of a WSRMP with certain weight vector [wi] ^. It is proved in |[T1 that the optimal input 
covariance matrices maximizing the weighted sum-rate must satisfy the polite water-filling structure and 
the polite water-filling levels are given by vwi?, for some constant > 0; 2) By Theorem 9 in HI, 
the weighted sum-rate optimal encoding and decoding order of each Pseudo BC (Pseudo MAC) is that 
the signal of the link with the 72* largest (smallest) weight is the 77* one to be encoded (decoded). For 
the sufficiency part, suppose certain f^i;L and tt satisfy the two conditions in Theorem [TT] For MAC, 
satisfying the second condition implies that the weighted sum rate vili $ (tt)) is a concave 

function of Si:j^, where t'/'s are the polite water-filling levels of f^i;L- Then from the proof for the 
sufficiency part of Theorem[8l 'Tii.l and vr maximizes the weighted sum-rate Yld ^i^i (^i:L) * ('^)) and 
thus achieves a boundary point of the capacity region of MAC. By Lemma IH they achieve the global 
optimality of OFOP or OSPMP. The sufficiency part for BC follows from the rate duality. ■ 
The proof of Theorem [TT] suggests an algorithm to improve the encoding/decoding order for each 
pseudo BC and pseudo MAC by simply updating the encoding and decoding order according to current 
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Table X 

Algorithm O (Improving the Encoding and Decoding Order) 

Initialize the encoding and decoding order tt such that $ (tt) is valid. 

1. Solve the FOP or SPMP with fixed * (tt). 

2. Calculate the polite water-filfing levels from the solution of FOP or 
SPMP obtained in step 1. 

3. For each Pseudo BC and Pseudo MAC with the polite water-filling 
levels obtained in step 2, if tt satisfies Theorem [TT] output tt and 
stop. Otherwise, set tt to satisfy Theorem [TT| and return to step 1 . 

polite water-filling levels. We refer to it as Algorithm O and summarize it in Table |Xl 

Remark 7: For the special cases of MAC and BC, if Algorithm O converges, the solution gives the 
optimal order by Theorem [TT] 

Remark 8: Optimal and sub-optimal algorithms for the special case of solving OSPMP for SIMO 
MAC/MISO BC have been proposed in |[T6l . The optimal algorithm is much more complex than Algorithm 
O as it involves several inner and outer iterations. The difference between Algorithm O and the sub-optimal 
algorithm are as follows. 1) The sub-optimal algorithm works for SIMO MAC, avoiding the calculation 
of beamforming matrices, while Algorithm O works for MIMO cases; 2) To find the encoding/decoding 
order, after obtain the optimal solution of SPMP with fixed vr in step 1 , the sub-optimal algorithm in (T61 
needs to solve an equation to obtain a weight vector [wi] such that [if, ■ ■ ■ ,X£] is the optimal 

solution of the WSRMP with the weights wi's and fixed vr, while in Algorithm O, the weight vector is 
directly given by the polite water-filling levels. Same as the algorithm in |[T6l . it is possible for Algorithm 
O to cycle through a finite number of orders. In this case, we can simply choose the best one from the 
finite number of orders. For MAC and BC, it is observed that the reason of non-convergence is that the 
corresponding boundary point can not be achieved without time-sharing. 

Remark 9: Algorithms to solve OFOP and OSPMP for MIMO BC have been proposed in IHl where 
the problems are converted to the weighted sum-rate maximization problem. The algorithms need to 
repeatedly solve a weighted sum-rate maximization problem for weight vectors searched by the ellipsoid 
algorithm. The weighted sum-rate maximization problem is solved by the steepest ascent algorithm. 
For OSPMP, additional search for the total power is also needed. In contrast. Algorithm O solves the 
problems directly in one sequence of iterations, replaces the steepest ascent algorithm with SINR based 
algorithms or polite water-filling, only searches weight vector suggested by the pohte water-filUng level. 
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resulting in much lower complexity. 

Remark 10: For BC and MAC, we can also use a simpler encoding/decoding order in [31] which is 
shown to be asymptotically optimal for MAC (and thus also for BC by duality) when the target rate of 
each user is the same and the number of users is large. At each transmitter (receiver), the signal of the 
link with its channel matrix having the n* smallest (largest) dominant singular value is the n* one to be 
encoded (decoded). For convenience, we refer to this order as MEB (maximum eigenmode beamforming) 
order. It can be shown that MEB order is optimal for SISO MAC and BC. 

V. Simulation Results 

Simulations are used to verify the performance of the proposed algorithms. Let each transmitter and 
receiver are equipped with Lt and Lr antennas respectively. DPC and SIC are employed for interference 
cancellation. Block fading channel is assumed and the channel matrices are independently generated 
by H/ fc = y/^;^H|^^, VA;, /, where H|^^ has zero-mean i.i.d. Gaussian entries with unit variance; and 
gi^k^kj is a positive number and the value is set as OdB except for Fig. [5] and Fig. [6l In Fig. UMl e,ach 
simulation is averaged over 100 random channel realizations, while in other figures, a single channel 
realization is considered. In all simulations we conducted. Algorithm A, B, PR and PRl are observed 
to converge to a stationary point of the corresponding problem. We call pseudo global optimum the 
best solution among many solutions obtained by running the algorithm for many times with different 
initial points and with the encoding/decoding order obtained by Algorithm O. For the plots with iteration 
numbers, we show rates or power after x.b iterations/rounds, where the last 0.5 iteration/round is the 
forward link update. 

Algorithm A can be used to find the achievable rate region boundary by varying the target rates ^^f's. 
It finds the point where the boundary is intersected by the ray a (Xj*, • • • In Fig. |3l we plot the 

boundaries of the rate regions achieved by Algorithm A with different decoding orders for a two-user 
MAC with Lt = 2, Lr = 4. It can be observed that the convex hull of the rate regions achieved 
by Algorithm A is the same as the capacity region, which implies that Algorithm A does achieve the 
global optimum for this case, and thus is a low complexity approach to calculate the capacity region for 
MIMO MAC. In Fig. |4l we plot the boundary of the rate region achieved by Algorithm A for a two-user 
interference channel with Lt = 2, Lr = 4. The pseudo optimum boundary achieved by Algorithm A 
is also plotted for comparison. In most places, the two boundaries overlap with each other except for a 
small area. It demonstrates that Algorithm A can find good solutions even with a single initial point. 

We demonstrate the superior convergence speed of Algorithm PR and PRl. Sum power versus iteration 
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Figure 3. Achieved rate region boundaries of a two-user MAC 
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Figure 4. Achieved rate region Boundary of a two-user interference channel 

number is shown in Fig. |5] for the iTree network of Fig. |2] and in Fig. |6] for a 3-user interference channel. 
Each node has four antennas. The target rate for each Unk is set as 5 bits/channel use. In the upper 
sub-plot of Fig. |5j we consider the moderate interference case, where we set gij. = OdB, \/k,l. In the 
lower sub-plot of Fig. \5\ we consider strong interference case, where we set gi^^ = lOdB, Z = 1,2 for 
the interfering links, and gi ^ = OdB for other k, /'s. It is not surprising that Algorithms PR and PRl 
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Figure 5. Convergence of the algorithms for the iTree network in Fig. [2] 
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Figure 6. Convergence of the algorithms for a 3-user interference channel 

have faster convergence speed because polite water-filling exploits the structure of the problem. In the 
upper sub-plot of Fig. [6l we set gi^k = OdB, VA;, /. In the lower sub-plot, we consider a strong interference 
channel, i.e., (7/ ^ = 10dB,VA; 7^ /, and gi i^ = OdB,VA; = /. Again, Algorithm PRl converges faster 
than Algorithm B. Since the problem is non-convex, the algorithms may converge to different stationary 
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Figure 7. Average sum power vs. the total rate for a 4-user MAC 
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Figure 8. Average sum power vs. the total rate for the B-MAC network in Fig. [T] 



points. But it can be observed that all the stationary points achieve similar performance. 

In Fig. |7J we evaluate the performance of Algorithm PR for a 4-user MAC with Lt = 2, L/j = 4. The 
'MSP' is the optimal solution obtained by the 'Algorithm 2' in lITSl . which has much higher complexity 
as discussed in Section II-BI For Algorithm PR, both the order obtained by Algorithm O and the MEB 
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Average Sum Power v.s. the Number of Training Rounds 
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Figure 9. Convergence of the distributed algorithm for a 3-user interference channel 



order is considered. When the target rate for each user is the same, Algorithm PR with both decoding 
orders achieves nearly the same sum power as the MSP but with much lower complexity. When the target 
rates of the users are different and are set as [Rg, 2i?s, 4i?s, 8Rs] /15, where Rs is the total required rate, 
Algorithm PR with the order obtained by Algorithm O still achieves near-optimal performance, while 
Algorithm PR with the MEB order performs worse than that. Not showing is that Algorithm B and 
PRl also achieve the same performance as Algorithm PR. In Fig. [H we evaluate the performance of 
Algorithms B and PRl for the B-MAC network in Fig. [T] with Lt = Lr = 4. The target rates are 
set as 2i?s, 4i?s, 8i?s] /16. The encoding/decoding order is partially fixed and is the same as 

that in Example [T] of Section IIV-GI For the pseudo MAC formed by link 2 and link 3 and the pseudo 
BC formed by link 4 and link 5, the fixed order that X3 is decoded after X2 and X5 is decoded after 
X4, and its improved order obtained by Algorithm O are applied. With the improved order obtained by 
Algorithm O, the performance of Algorithm B is not shown because both algorithms achieve nearly the 
same performance as the Pseudo global optimum of Algorithm PRl, while the algorithms with the fixed 
order suffers a performance loss. 

We illustrate the convergence behavior of the distributed optimization with local CSI. Fig. |9] plots the 
total transmit power and the minimum rate of the users achieved by Algorithm PRD versus the number 
of training rounds for a 3-user interference channel with Lt = Lr = 4. In Algorithm PRD, the target 
rates are set as /3 (if ,12,^3) with /3 = 1 and /3 = 1.05 respectively, where = 5 (bits/channel use). 
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I = 1,2, 3. It can be observed that after 2.5 rounds the rates are close to the targets and after 3.5 rounds, 
the powers are also close to the that of infinite rounds. When /3 = 1, the achieved minimum rate after 3.5 
rounds equals to or exceeds 5 (bits/channel use) in 88 out of 100 simulations. When /3 = 1.05, achieved 
minimum rate after 3.5 rounds equals to or exceeds 5 (bits/channel use) in all 100 simulations, while the 
total transmit power is about 0.7 dB larger. This suggests a trick that use higher target rates than true 
targets in order to satisfy the target rates in fewer number of iterations at the expense of more power. 

VI. Conclusion 

The general MIMO one-hop interference networks named B-MAC networks with Gaussian input 
and any valid coupling matrices are considered. We design algorithms for maximizing the minimum 
of weighted rates under sum power constraints and for minimizing sum power under rate constraints. 
They can be used in admission control and in guaranteeing the quality of service. Two kinds of algorithms 
are designed. The first kind takes advantage of existing SINR optimization algorithms by finding simple 
and optimal mappings between the achievable rate region and the SINR region. The mappings can be 
used for many other optimization problems. The second kind takes advantage of the poUte water-filling 
structure of the optimal input found in fl]. Both centralized and distributed algorithms are designed. The 
proposed algorithms are either proved or shown by simulations to converge to a stationary point, which 
may not be optimal for non-convex cases, but is shown by simulations to be good solutions. 

Appendix 

A. Proof for Theorem^ 

Because each link is equivalent to a single-user channel after whitening the interference plus noise, 
we only need to prove that for any S achieving a rate = log |l + HI]H1^| in a single-user channel 

H, there exists a decomposition of S = TT^ leading to {T, R, p} which achieves a set of SINRs 
7^ = e^(^)/A^ - 1, m = 1, M. 

First, we show that considering unitary precoding matrix V G qMxM j^^^ ^oss generality. Note 
that I (S) = log I + HtVVtttHt = log |l + HVVtfit | , where H = HT is the equivalent channel 
with unitary precoding matrix V = [vi, vjv/]. Define 

-1 



= fit ^ HvivjHt + I^J H 

\i=m+l / 



The SINR of the m^^ stream achieved by the MMSE-SIC receiver is given by |[26 
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Hence, we only need to find a unitary precoding matrix V such that VmA^Vm = e-^^^^/*^ — l,m = 
1, M. Then the precoding matrix for the original channel is give by T = TV. 
We will use the method of induction. We first find a unit vector v jv/ such that vt Ajvf vj\/ — e-^(^)/^^ — x 



Let Aj^*^' be the z*^ largest eigenvalue of Am and u^^^"'' be the corresponding eigenvector. Since X(S) 



(M) 



logll + HVVtHtl 



log |I + Am| = lognf^i (l + aS*^^), we must have A^^ > e 



AM) 



J(S)/Af _ 1 and 



^(M) < ^x(E)/A/ _ i_ Note that v^Amvm = ZZi 
form orthogonal bases, there exists a vm such that 

2 



A 



(Af) 



. Because |u-™\i = l,...,Af| 



0, i = 2,...,M-l, 



,(Af) 



+ 



AM) 



AM) 



„X(S)/Af _ ^ 



Then it follows v], ^Aa/va/ = e^^^)/^^ - 1. 



Assume we already found a set of mutual orthogonal unit vectors vi,l = m + I, M such that 
vjA;V/ = e-^(^)/*^ — 1, 1 = m+1, M. The rest is to prove that there exists a such that vJ„AmVm = 
gX(s)/A/ _ aj^j orthogonal to Vi,Z = m + 1,...,M. Perform SVD A^ = UmDmUL. Let 



aI*"^ be the n^^ largest eigenvalue of A^ and Un"' be the corresponding eigenvector. Define 



Ah 



(m) 



Am) 



U 



(m) 



M 



=m+l J 



V ur'v,-,n = 1,...,M, U, 



(m) ^ (m) 
^1 )---)'^iM 



and Ar 



UmDmUL- Then for 



j = 1, m, we have 



M 



viA^v,=vij]u(r)A(r) 

n=l 
Af 

=v|j]uMA(r) 



u 



(m) 



U 



(m) 



t 



(42) 



n=l 



=vJA„Vj, 

where (l42l ) follows from the definition of ui'"'' and the fact that vj^v^ = 0, i = 1, m, k = m + 1, . 
Because A^ is positive semidefinite and 

AmVj = 0, i = m + 1, M, 



(43) 



the rank of Am must be less than m + 1. Let A^-™^ be the i*'^ largest eigenvalue of A^ and u^"'' be the 
corresponding eigenvector. Then we must have A^-"^^ = 0,i = m + 1, M. Define = [vi, Vm,]. 
Note that the interference from the last M — m streams is J2i=m+i ■ Then the sum rate of the first 



- (m) 
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m Streams is given by 



log 



I + HV„Vj„Ht Hv.vjHt + 

\i=m+l / 



=log I + YlAmY^ 
=log I + Vj„A^V„ 

=log|l + VtA„V 

m 

=iogn(i+A^^ 



(44) 
(45) 



m 
M' 



where ((44)) and (1451) follows from (|42l) and (1431) respectively. Therefore we must have A^'"'* > e-^^^-*/*^ — 1 
and ~xt^ < e^(^)/^ - 1. Note that 



M 



-(m) 



n.=l 



Because |uj^"^\ z = 1, m| form orthogonal bases of the m-dimensional subspace orthogonal to v^, / 
m + 1 , . . . , M, there exits a unit vector v„i in this subspace such that 



t ~(™) 
v' u- 

m I 



0,i = 2,...,m - l,m + 1, ...,M, 



t -(»") 

V Ui 



a('")+ 



2 _ 



AM = e^(^)/^^ - 1. 



Then we have VmA^Vm = e^^"^^/^ — 1. This completes the proof. 



B. Proof for Theorem^ 

Note that E = Yjm=iPrntm^\n impUcs that M > Rank(S). Define an M x M DFT matrix F 



where the element at the k row and / column is Ffe^ = e'^^^ j y M . If M is chosen to be greater 
than or equal to Lt, let Fq G c^txA^ matrix comprised of the first Lt rows of F. Otherwise, 

let Fq G ([^Lry-M jj^g matrix such that the upper sub matrix are F, and other elements are zero. 
Perform SVD Xl = UDU^, where the diagonal elements of D are positive and in descending order. Let 
T = UD^/^Fq. It can be verified that TT^ = The norms of the columns of T are the diagonal 
elements of T^T = FqDFq and they are equal to ^"l^ ''' ■ Then the corresponding transmit powers 
satisfy pm = Tr (S) /M, Mm. 
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C. Proof of the Theorem |S] 

First, we show that any ^i;L satisfying the optimaUty conditions for FOPa must satisfy the KKT 
conditions of FOPa. The Lagrangian of FOPa (l32l) is 

L{^i.,l,vi,@i.,l) (46) 
[Pt - Ef=i Tr m) + Ef=i Tr {^i&i) . 

The KKT conditions are 

Vs,L = 0, Tr {^i&i) = 0, Sz, @i h 0, V^; 
i^i > 0, X?X; (Si^i, *) = Ij^li *) , V/ / 1; (47) 

/^>0, PT = Ef=iTr(5]z). 

Recall zJ; is the polite water-filUng level in the optimality condition. Let n = 'i-/ J2h=i^?^i ^i^^ = IJ^'^fh- 
Then the condition V^^L = can be expressed as 

k^l ' ^ 

By Theorem [3l can be expressed as 

±1 = i)i {ill^ - ((li + H;,;EiH|) j , VL (49) 



Hfe, + I 



(48) 



Substitute Xli;/, into condition (1481 ) to obtain 

hi = (Qi + Ui^itiHiy^ Ui^i + io;, V/. (50) 

Because the KKT condition dSOl ) is also that of the single user polite water-filling problem over the 

~ —1/2 ~ —1/2 ~ 

channel Cl^ ^fi^ and by the optimality condition, 1]i has polite water-filling structure over this 



channel, ^i;L satisfies condition (1501) . It can be verified that Si:L also satisfies all other KKT conditions 
in dlT]). 

With a similar proof as above, one can show that for SPMP, the necessary optimality conditions also 
implies that the KKT conditions hold. 

The sufficient part for FOPa is proved by showing that the optimum of FOPa is equal to the optimum 
of some weighted sum rate maximization problem (WSRMP) and the KKT conditions are sufficient for 
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global optimality when the weighted sum rate J^i" ^i^i (^i:L) is a convex function. Suppose certain 



Tii-L satisfies the optimality conditions for FOPa. Then following similar steps from (1461) to (pQl) . it can 
be shown that Si:^ satisfies the KKT conditions of the following WSRMP. 



WSRMP:max (51) 

^'■^ 1=1 

L 

s.t.E; ^ 0, / = 1, • • • , L and J]] Tr (S/) < Pt, 

1=1 

where {i>i} are the polite water-filling levels corresponding to "Si-.l- 

Because i/^X; (Si-/^, is a concave function of "Ei-i, the KKT conditions are sufficient for 

the global optimaUty of WSRMP. Noting that 2/ (jli-^,^^ = al^,yi. If S^l is not an optimum of 
FOPa, there exists a S^.^ satisfying all the constraints, such that Xi /Z^ > a, from which 

it follows that Ii /I^ > a,\/l since T,'^.^ satisfy the first constraint in FOPa. Then we must 

have ^i^i ^) > J2i=i = Yld=i ^i^i (^1:-;^) which contradicts the fact that ^x-l is 

an optimum of WSRMP. 

Similarly, using KKT conditions, the sufficient part for SPMP can be proved by showing that is 
the global optimum of the following convex optimization problem 



min^Tr(SO 

1=1 

L 

s.t.Sj ^ 0, Z = 1, • • • , L and m (Si:L, ^) >Y. "^^^i 



L 


i 

1=1 1=1 
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